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Process foy Data Distribirtion thr ott^li a Network 
Technical field of the mveXKtlon 

5 The inveaitiott relates to data coixnnunication and niOfe particularly to a process for 

disseminating or distributing data ^ough a network. 

Background art 

10 The constant progress of the telecommunication systems, particularly with the explosion of 

the Internet and intranet networks, has resulted iij the development of an era of information. With a 
single personal computer, it is possible to get a connection to the Internet network, and have direct 
access to a wide range of information, services and dectronic documentation. More and more 
publishers provide on-line electronic documentation, books> reviews and all sorts of electronic media 

15 files to customers connected to the network. 

As die size of the Tnteniet or a private intranet network increases, the need for distributed 
replication of an electronic file increases also. This capability is essential in a wide variety of 
situations. For instance, this is required when deploying large execution sets on all the machines of a 
20 cluster is planned and also the num^ous situations of replication of weh data from a Web server to 
Web caches. When a new version of an operating ^stcm or other program is m^de available there is 
an coormous need for quick replication of the software on a large number of different machines, 
which needs to be achieved without excessive stress on liie web server and the telecommunication 
network. 

25 

Generally speaking, the transmission of a given file and its replication at different locations 
' within a network is normally performed in a master/slave mode, that is to say in a manner which 
concentrates the burden of the file copy on one central distribution point. In most situations, when the 
size of the electronic file to be transmitted is significant, and when there are a large number of 
30 recipients, this results in stress on both the master device — e.g. a web server — and also on the 
network- As an illustration, shouM one file of 500 Kbytes be provided to a numbei" of 1 000 
recipients by one web server, the replication of the 1000 files will entail a trafllc of 500 Mbytes for 
that server and for the communication path. 

35 A technique known as TP multicast can in some circumstances be used to improve tlie 

situation but this, however, is not available on die public Internet network, because currently most 
routers do not support the Internet Group Management Protocol (IGMP)- 
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The use of a proxy can in some circiimstanc?es relieve both the publisher of tiie electronic file 
axid the netwoik. As is well known, a proxy device functions to avoid direct connection from one 
piachine - generally within an Intranet network - to another machine being outside the Intranet 
5 nct^vork. The conmiimication between the first and the second machine is achieved via the proxy 
which receives and forward all tlie requests on behalf of the former to the latter- 
Figure 1 illustrates the topology of a Intranet network which comprises a set of netvvork 
devices - e.g. a router 9 allowing communication between a first subnetwork A and a second 

10 subnetwork B comprised within the Intranet, but also the computers 6 and 7, a printer 8 and an 
intranet server 10. All the network devices of the Intranet communicate with the Internet network 3 
via a proxy element 5 allowing interface between the boundary of flae Intranet and the Internet It can 
therefore be seen that when a proxy is arranged within Tnti-anet network of a private organization 
for instance, all the requests from the machines of that Intranet network are first forwarded to the 

15 proxy, which accesses tlie remote machines and returns the results to the original requestor. Typically 
proxies are hnked with other fiinctionahty, in particular the cacliing of web pages to reduce tralDc so 
that one electronic file requested by a great number of machines belonging to one Intranet network is 
directly available without requiring multiple download operations from the publisher of the electronic 
files. Therefore, if one device - e.g. computer 6 requests one electronic file from an external web 

20 server 1, the latter can be cached within Proxy 5 for later diffusion within the Intranet, 

Summary of the invention 

This invention is directed to a process for distributing through a network an electronic 
25 message associated with a list of recipients, said process comprising the iterated application of tlie 
steps of: 

a) extracting from said list a first sublist and a second sublist of recipients; 

b) identifying a recipient witiiin said first sublist which can be addressed via said network, and 

c) ti-ansmitting said electronic message and said first sublist to said identified recipient for onward 
30 distribution; and 

d) applying said steps (a) to (d) to said second sublist of recipients. 

The process can also comprise receiving the message and a list of recipients for onward 
distribution. The above described process can thus be cairied out in a plurality of nodes of the 
35 network upon receipt at each node of the message and an associated first sublist generated in another 
. of the nodes until the message has been transmitted to all recipients On the list. 
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This process permits the burden of the transmission of the eleclronic document to be shared 
between different recipients, each of which reapplies the same procc^ss and reiterates using the 
original list of recipients until the eleclronic file or document is completely processed and transmitted 
to every recipient Preferably, the two suWists are arranged to have a similar size and together 
5 comprising all the recipients listed in the list of recipients. 

m at ieasi preferred cmbuuiuicnib, swU iucs»s>a}^w. auu. uiC "Sist of rcc-ipicntc axe ccTit^ined in ?. 
package^ ihe step of transmitting the electronic message to the identified recipiexnt comprising 
generating a new package comprising the electronic message and the first list. The package can he 
)0 .an XML document for instance, and can comprise information defining tiie size and the date of 
creation of said electronic message. 

In one embodiment, the transmission of 1ko electronic package comprising the sublist is 
achieved via an HTTP channel, thus allowmg easy transmission through the proxy of the Intranet 
15 networks. 

in one embodiment, the process is executed by an agent which is embodied by a 
computer program having ics own installation procedure. Alternatively, the agent can be a specific 
component of an operating system allowing effective publishing functionality. 

20 

The invention provides a computer program product which permits distribution or flooding 
of the information throughout a teleconmnunioation netv/ork, such as an Interaet^intranet network for 
iostanoe. 

2S Description of the drawings 

An embodiment of the invention wiD now be described, by way of example only, with 
reference to the accompanying drawings, wherein: 

30 Figure I illustrates a network including a a proxy system. 

Figure 2a and 2b respectively illustrate a computer system tor performing a distribution 
process and a distiibution control agent 25. 

35 Figure 3 illustrates the topology of five computers providing the distribution of an electronic 

file. 
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Figure 4 illustrates the process executed by a publislier. 

Figure 5 illustrates the process executed by a control agent of a network devieer 

5 Figure 6 illustrates tlie execution of the process on a topology of eight devices. 

Description oi'tbe preicrrea eniDoaimeni. of liic iuvtiiiuuAl 

With reference to figure 2 there is shown a block diagram of a system which permits 
10 distribution of electronic files. The system is illustrated as residing within a computer system 20 
which may be a personal computer or my conventional suitably arranged conopxrter system. The 
system is arranged to use existing electronic circuitry to perform the functions which will be 
described hereinafter. 

15 The computer 20 is illustrated as including z memory device 21 which may be a random 

access memory (RAM), Read-Only-Memory (ROM), a conventional mass storage, such as a hard 
disk drive or the like. The mcn^ory device 21 is conventionally capable of storing any type of data, 
but is is particularly used in the present situation for achieving local replication of electronic files or 
electronic messages as will be explaiDcd hereinafter with details. In the following the term electronic 

20 messages will be used for simplicity to refer to the data that i$ to be disseminated within the network 
in whatever form and which may be received and stored within the memory deface 21. In typical 
applications tliis data may be for instance be files such as word processed Or other documents, 
program files or media files (mp3, asx, pdf) files in accordance with known standard formats, but the 
application of the techniques described is not limited to any particular type of file, message or data. 

25 For instance^ the process woiUd be particularly useful for distributing media files such as electronic 
newspapers winch need to be disseminated very rapidly. Tlie mass storage may usefully be arranged 
to include a specific storage area (not shown) which is reserved for receiving the electronic files 
received and stored in accordance witii the process which will be described with reference to figures 
4 and 5. 



30 



35 



The computer 20 is fturfher illustrated as including processing circuitry 22 \?virich generally 
governs the flow of software instmctions and data within the computer system. In figure 2, the 
processing circuitry 22 is particularly arranged to provide readAvrite access to the memory device 21. 
Computer 20 further includes conventional input/output devices 23. 

Computer 20 further coraprises communication circuitry 24 The communication circuitry 24 
may be, for instance, a modem allowing the computer 20 to communicate via a telecomraunication 
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nolwork, such as an Internet network 3. It should be understood, howevcaf, that the type of 
communication circuiliy with which computer 20 commUTiicates with the Internet network - for 
instance to access a web server 1 with a database 2 - is not important. In the case of an Internet 
connection, a TCP/IP communication layer is provided for supporting the coniinmication layers 
5 which are involved in the communication steps of the process described herein. In most cases» the 
access to the Intcmet network is achieved via the World Wide Web via the well-knoxvn Hyper Text 
IranSTer rrotocoi (Hrfr). vvlulc HTTr U vci^ u*cfal, pr.iticiLbrly bscousc it ie v/^VL ?.d*p+'=^'1 *n ihe: 
use of firewalls, any other kind of protocol allowing communicalion via the network can equally be 
used. 

10 

In addition to the conventional software components which are used for controUing the 
processing circuitry 22, and which generally includes the operating system and a set of application 
software, the computer 20 is provided with a distribution control agent 25 which consists of a set of 
processing instructions stored within the memory 21 and which are used for controlling the sequence 
15 of operations which will be described below. 

In this embodiment, the distribution control agent 25 is a specific software component 
provided with and embedded within the operating system installed within the machine. However, it 
will be understood that Ihe functions described may be adapted to other configurations, such as in the 
20 form of a separately installed application program. 

In Ihis embodiment, distribution control agent 25 comprises two distinctive components: an 
emitter element 27 associated with a receiver clement 26, as shown in figure 2b. The emitter element 
27 is arranged to communicate with corresponding receiver elements 26 of other computers which 
25 .are accessible via the network. 

With respect to figures 3 and 4 (here will now be described the process for distributing a 
message. Computer 31 - including a distribution control agent 25 with one emitter (E) element and 
one receiver (R) element - is assumed to transmit an electronic message to a set of recipients 

30 including recipient 32, 33, 34 and 35 shown in the figure 3. Recipients 32, 33, 34 and 35 are 
generally conventional computers such as illustrated in figure 2, but could also be more specific 
network devices allowing Interriet access, such as mobile telephones, communicating wrist watches 
or similar appliaces, for instance, Tt is assumed tliat any device can communicate via any other 
device through the network, which is generally tlie case when each device is configured as a nenvork 

35 node provided with an Internet Protocol (IP) address. 
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Figure 4 more particularly shows processing steps which are involved for achieving 
distribution of the messages. This process is launched by publisiiiBg server 3 1, for instance. 

In a step 41, distribution control agent 25 receives an electronic message to be distributed, 
5 the latter beiag accompanied by a list of recipients (e.g. nodes 32-35 of figure 3). As mentioned 
above, the electronic message can be received from an application program running within the 
computer. 

Upon reception of die message accompanied by the list of recipients, Agent 25 creates in a 
10 stq? 42 an electronic package containing the message to be distributed and the above mentioned list 
of recipients. In the case of a TCP/IP network, the recipients are defined by the TP addresses or by the 
CTfiail addresses. Preferably, the electi-onic paclcage which is created at step 41 is an XML tile which 
COiXJplies with the standard eXtcnded Mark-up Language structure and which is associated with a 
Document Type Defixiition (DTD) file. The XML file preferably comprises additional ititbrmation 
15 such as the date of creation of the electronic file. 

In a step 43, Agent 25 forwards the XML package to its receive control element 26 for 
* furlher processing. 

20 The processing which follows after step 43 is the general distribution processing which is 

depicted in figure 5 and which will now be described in detail. 

The process is launched when the receive control element 26 of agent 25 receives inastcp 51 
an XML package which is to be processed. 

25 

Step 51 is then followed bv a step 52 where agent 25 reads the container comprising the 
package and extracts the list of recipients thereti included. If necessary. Agent 25 then removes 
itself from the list of recipients (not shown). 

30 In a step 53> the agent 25 performs a test to determine whether the list of recipients is empty, 

in which case, the process proceeds to a step 59 which is the completion of the process. 

However, in the general case, the list of recipients is not empty, and the process proceeds to a 
step 54 where agent 25 computes from the list of remaining recipients received in step 5 1 a set of two 
35 distinctive subJists: a first and a second sublist. First and second sublists share no common element 
and their joining permits the orighral list of recipients which was included within the original 
package to be regenerated. In one embodiment, the two sublists are arranged to have substantially 
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the same size* More particiilarly , in the preferred embodiment, the first list has a number of elements 
which is set to be equal to the integer which is iraroediateiy superior to half the nm[nbeT of recipients 
of the original list With respeet to the scheme of figure 3, it can be seen that sublist I can include 
nodes 32 and 34, while sublist 2 includes nodes 33 and 35. fo the particular case where the list is 
5 limited to a single recipient, the process divides the list into a first and a second list, the second one 
being empty. 

In a sten 55. agent 25 determines via transmit control element 27 one particular recipient 
within the first sublist ^vhich can receive a notification via the network, for instance node 32 of 
10 sublist 1. Different methods and means can be arranged for achieving the above determination. 
Preferably, agent 25 issues a PING command to determine whether any particular recipient is 
available on the network. 

When the above determination completes, the agent 25 generates in a sten 56 a first 
15 subpackage comprising the original electronic document to distribute, accompanied with the first 
sublist within an XML file which is formatted in a similar fashion to die original electronic package 
which was created in step 41 of figure 4. In the example of figure 3, it can be seen that first subhst 
includes nodes 32 and 34. While figure 5 clearly illustrates that the determination of the particular 
recipient (step 55) precedes the generation of the electronic package, it should be noted that the 
20 generation of the electronic package could be carried out before the determination of the appropriate 
recipient to which the package will have to be transmitted. 

hi a step 57. the first subpackage is then fomarded to flie transmit control element 27 for 
further propagation through the network to the recipient which was identified in step 54. This 

25 transmission is particularly illustrated m figure 3 by arrow 36 showing the propagation of the first 
subpackage from node 31 (the publishing node) and node 32. Preferably, the HTTP protocol is used 
for tins transmission to &cilitate the transmission through firewalls. Alternatively, this transmission 
can be achieved via the File Transfer Protocol or any oOier suitable protocol. Once the recipient 
receives tlie first subpackage, the latter will correspondingly initiate the general distribution process 

30 of figure 5. 

In a step 58, agent 25 generates a second subpackage comprising the original electronic 
document or message, accompanied by the second sublist which is formatted within a XN'IL file and 
forwarded to the receive control clement 26 of the same node and same agent. In the example drawn 
35 in figure 3, it can be seen that the secoud package has a second sublist including nodes 33 and 35. 
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The pnrocess then proceeds back to step 51 for the puipose of processing this new electronic 
package in accordance wifh 1iie sequence of steps 51-58 which were described above. This permits a 
new pair of subKsts to be computed, respectively a sublist cotitaining node 35 attd a second sublist 
containing node 33 permitting later transmission of the electronic file. Node 31 will therefore 
5 successively transmit the elect-onic document to node 35 (arrow 38) and then to node 33 (arrow 39). 
Node 34 will directly receive the electronic file from a direct transmission (arrow 37) from node 32. 

It should be noted that tlie process of figure 5 which was described m detail above, is 
executed in each agent of the different machines which are involved in tlie distribution process and 

10 which receive the package. Therefore, the process is successively executed in the different recipients 
until the full distribution of the original document or file is achieved. Furtiier, it can be seen that the 
number of replications of tiie original electronic documents is shared between the ditTercnt recipients 
(ic nodes 3 1 and 32 in figure 3) of the different consecutive sublists which are computed. Neither the 
originating publishing server, nor tlie network is disproportionately stressed or loaded by the 

1 5 distribution of the document* even if a great number of recipients is to be reached. 

More particularly, it can be seen that the number of successive downloads which is to be 
managed and haadled by the publishuig server is decreased in a ratio of logZ wi1h respect to the 
number of machines which are to receive the original message. This can be seen with more clarity in 

20 figure 6 illustrating the process for 7 nodes. Assuming that the original list of recipients includes 
nodes 61, 62, 63, 64» 65, 66 and 67, it can be seen that the first sublist which is generated in step 54 
of figure 5 comprises nodes 62, 66 and 67 while the second sublist which is generated includes 61, 
63, 64 and 65. When the first subpackage (containing the first sublist) is transmitted to node 62, the 
originating node 61 only has to process a second subpackage comprising a second sublist which is 

25 reduced to contain nodes 61, 63, 64 and 65. 

At the next loop, when node 61 receives flie second subpackage, a new iterative process is 
executed, what results in the computation of a new set of two subpackages, respectively containing 
nodes 64-65 and nodes 63, and the transmission of the former for flirther processing to node 64. It 
30 can then be seen that node 61 no longer has to transmit the electronic file to node 65 because node 65 
receives the message via from the processing of the above mentioned subpackage by node 64. 

While the invention was particularly described wifh reference with a computer, it should be 
clear that the process may be adapted to any device which can communicate with the network^ and 
35 which can receive an electronic message. Such devices may include mobile telephones and more 
generally any device which can be fitted wifh a distribution control agent, such as agent 25, for Che 
purpose of executing the steps of process illustrated in figure 4 and 5. 

& 
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CLAIMS 

1. Process for (iUstributmg through a network an electronic message associated ivilii a list of 
reoipirats, said process comprising the iterated application of the steps of : 

5 

a) extracting (54) from said list a first sublist and a second sublist of lecipieDts; 

b) identtlymg a recipient wimin said fu&i i>uvji:>'i wmvli car^ be addiccced vi:: said rs^/ork, 

c) transmitting said elcctromc message and said fest sublist to said identified recipient for onward 
distribution; and 

10 d) applying said steps (a) to (d) to said second sublisi of recipients. 



2. Process as claimed in claim 1 comprising receiving the message and a list of recipients for 
on^vard distnbution. 

15 3. Process according to claim 1 or claim 2 wherein said message and the list of recipients are 
contained in a package, the step of transmitting the electronic message to flie identified recipient 
comprising generating a new package comprising the electronic message and the first list, 

4. Process as claimed in claim 3 wherein the package is an XML document. 

20 

5. Process according to claim 4 wherein said electronic package comprises information defining 
the size and &e date of creation of said electronic message. 

6. Process according to any preceding cteim wherein the computation of said first and second 
25 sublist results in the generation of sublists of similar sissc. 



7, Process as claimed in any preceding claim wherein the first and second sublists together 
comprise all the recipients listed in the list of recipients from which they are extracted in step (a), 

30 8. Process according to any preceding claim wherein said transmission of the electronic 
message is perfomied via a Hyper-Text Transfer Protocol link. 

9. Process as claimed in any preceding claim wherein the itemted steps (a) to (d) are carried out 
in a single node of the network. 

35 

1 0- Process for distributing an electronic message associated with a list of recipients through a 
network comprising carrying out a process as claimed in any preceding claim in a plurality of nodes 
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of the network upon receipt at each node of the message and an associated first sublist generated in 
another of the nodes until the message has been transmitted to all recipients on the list. 



11- Process for distributing an electronic package received from a telecommunication network, 
5 said package comprising an electronic message accompanied by a list of recipients, said process 
comprising: 

- a) extracting (52) said list of recipients; 

- b) determining (53) whettier said list of recipients includes at least two items and, if so; 
10 - c) generating (54) a first and a second sxib}ist of recipients extracted from said list; 

- d) identifying (55) one particular recipient from said second list which is reachable via said 
telecotmnunication network; 

- d) generating (56) a first subpackage comprising said electronic document or file with said first 
Bublis^ 

15 - e) transmitting (57) said first subpackage to said identLBed particular recipient; 

- applying steps b-f to said second sublist. 

12. Process according to claim 11 wherein said package is an XML document comprising said 
electronic document or file and a list of recipients for said transmission, said XML document 
20 comprising iiiformation defining the size and Ihe date of creation of said electronic document or file. 

13- Process according to claim 1 1 wherein the size of the first suhUst is fixed to be equal to the 
mtegcr which is hnmcdiately sup^ior than half the number of items of tiie list of recipirats. 

25 1 4. Process according io claim 1 1 wherein the compulation of said first and second subhst results 
in the generation of sublists of similar size. 

15. Process according to claim 11 wherein said transmission of said first subpackage is 
perfbimed via a Hyper-Text Transfer Protocol link. 

30 

16- Publishing agent in the form of a computer program having program code elements for 
carrying out the process defined in any of claims I to 10- 

17. Computer program product comprising program code elements for distribution or fiooding 
35 of an electronic file or document through a telecommunication network, and arranged to execute the 
steps of; 
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- generating (41) a package comprising said electronic docmnent or file accompaxxied "by a list of 
recipients; 

- transmitting (43) said package to a first neoiptent; said first recipient extracting said list of recipients 
and computing a first snbiist and a second sublist of recipients; 

~ generating (54) a first subpackage comprising said electronic file or document with said first sublist 
and transmitting said first subpackage to a second recipient which is identified fiom said first sublist; 

- ^«;^iici£illu^ (:;S) a sccor^d T-ibpaclciigc: coinpricir.^ cnid ?!ectro".^r t^nMmenf with ftaid second 
sublist and processing again said second package by said first recipient 



10 18. Computer program product comprising program code elements for allowing distribution or 
flooding of one electronic file or document contained witliin a package including a list of recipients 
through a icJecommunication network, and arranged to execute the steps of; 



- receiving (5 1) said package comprising said electronic document or file accompanied with a list of 
15 recipients; 

- extracting fiom said package and said list of recipients a first sublist and a second sublist of 
recipients; 

- generating (54) a first subpackage comprising said electronic file or document vrith said first sublist 
and transmitting said first subpackage to a second recipient which is identified fiom said first sublist 

20 as being reachable; 

- generating (58) a second subpackage comprising said electronic file or document with said second 
sublist and processing again said second package by said first recipient. 

19- Computer program i>roduct in accordance wifli claim 16 embedded as a component of an 
^5 operating system. 

20. A data communications network wherein at least a subset of nodes comprise program code 
elements for carrying out a process as claimed in any of claims 1 to 10. 

30 
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Abstract 



Process for Data Distribntion thiroiis[li a Network 



5 



A process for distributiiLg through a network an eleclronic iiuc;ssage associated with a list of 



recipients, said process comprising the iterated application of the steps of: a) extracting from said list 



which can be addressed via said network, and c) transmitting said electronic message and said first 
sublist to said identified recipient for onward distribution; and d) applying said steps (a) to (d) to said 
10 second sublist of recipients. The process can also comprise receiving the message and a list of 
recipients for onward distribution. The above described process can thus be carried out in a plurality 
of nodes of the network upon receipt at each node of the message and an associated fh-st sublist 
generated in another of the nodes until the message has been transniitted to all recipients on the list. 




lipisr^tc; b) ii^Alfyizi*^ 9 rf^rA-mmt within said frrst SUblist 



15 Fig. 5 
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Agent 25 receives paetoge containing 1 
list of recipients via rooeive element 26 L 



Agent extracts tist of recipients 
from package 




3. 



YES 



A&ejit generates a firsx & 
second sublist of recipients 



Agent deterittines witliin first sublist j 
one recipient being reachable |_ 



Agent creates first subpaokage 
containing dQCXirnent+ sublistl 



Agent ti'ansinits said first $ubpackage 
to determined recipient 



nL_ 1 

Agent creates second subpaokage 
& transmits to receive element 26 _| 



Complete process 
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Fig. 5 
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